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Higher-order cis-acting RNA replication structures have been identified in the 3’- and 5’-terminal untranslated 
regions (UTRs) of a bovine coronavirus (BCoV) defective interfering (DI) RNA. The UTRs are identical to those in 
the viral genome, since the 2.2-kb DI RNA is composed of only the two ends of the genome fused between an internal 
site within the 738-nucleotide (nt) 5’-most coding region (the nsp1, or p28, coding region) and a site just 4 nt 
upstream of the 3’-most open reading frame (ORF) (the N gene). The joined ends of the viral genome in the DI RNA 
create a single continuous 1,635-nt ORF, 288 nt of which come from the 738-nt nsp1 coding region. Here, we have 
analyzed features of the 5’-terminal 288-nt portion of the nsp1 coding region within the continuous ORF that are 
required for DI RNA replication. We observed that (i) the 5’-terminal 186 nt of the nsp1 coding region are necessary 
and sufficient for DI RNA replication, (ii) two Mfold-predicted stem-loops within the 186-nt sequence, named SLV 
(nt 239 to 310) and SLVI (nt 311 to 340), are supported by RNase structure probing and by nucleotide covariation 
among closely related group 2 coronaviruses, and (iii) SLVI is a required higher-order structure for DI RNA 
replication based on mutation analyses. The function of SLV has not been evaluated. We conclude that SLVI within 
the BCoV nsp1 coding region is a higher-order cis-replication element for DI RNA and postulate that it functions 


similarly in the viral genome. 


cis-acting RNA signaling elements that function directly in 
the replication of positive-strand viral genomes have been of- 
ten first identified in helper virus-dependent defective or de- 
fective interfering (DI) RNAs (18, 26). One advantage in using 
DI RNAs for this identification is that cis-acting signals for 
replication can be evaluated independently from other func- 
tions for the same structure. Such functions might include 
encoding a trans-acting factor or regulating the synthesis of a 
trans-acting factor (e.g., regulation of translation to produce a 
replicase). With the use of a 2.2-kb DI RNA of the bovine 
coronavirus (BCoV), a molecule composed of only the two 
ends of the viral genome (8), primary and higher-order cis- 
replication structures have been identified in both the 3’ and 5’ 
untranslated regions (UTRs) (described below). Other cis-rep- 
lication structures in these regions, identified first in the closely 
related mouse hepatitis coronavirus (MHV) genome, have also 
been found in the BCoV DI RNA (described below). 

Three cis-replication elements have been identified in the 
289-nucleotide (nt) BCoV 3’ UTR: (i) a bulged stem-loop 
beginning just downstream from the stop codon of the N gene 
(20, 21) and an adjacent pseudoknot (45), which possibly to- 
gether act as a single higher-order molecular switch (14). The 
bulged stem-loop was first identified in the MHV genome 
(20-22). An important part of the switch concept derives from 
the existence of a 5-nt sequence that can function as a com- 
ponent of either the bulged stem-loop or the pseudoknot, 
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but not both simultaneously (14). The bulged stem-loop-— 
pseudoknot structure appears as a common feature among 
group 2 coronaviruses and has been analyzed in most detail in 
the context of the MHV genome (14). (11) A complex bulged 
stem-loop near the 3’ terminus of the 3’ UTR, which contains 
the coronavirus common GGAAGAGC octamer sequence 
and an adjacent helical region (H. Y. Wu and D. A. Brian, 
unpublished). This structure was first identified in an MHV DI 
RNA (47). (iii) A 3’-terminal poly(A) tail (42; S. D. Senanay- 
ake and D. A. Brian, unpublished). Interestingly, to date only 
cellular proteins have been shown to interact with the 3’ UTR 
structures, and of these, only the 3’-terminal octamer-associ- 
ated bulged stem-loop region (32, 47, 48) and the poly(A) tail 
(42) have been shown to be targets. The cis-acting functions of 
the 3’ UTR elements may be common among the group 2 
coronaviruses, since the BCoV DI RNA can replicate with the 
helper function of several group 2 coronaviruses (46), and the 
entire 3’ UTR of the BCoV and of human severe acute respi- 
ratory syndrome-associated coronavirus (SARS-CoV) can 
functionally replace the 3’ UTR in the MHV genome (16, 21). 

Three cis-replication elements have been identified in the 
210-nt BCoV 5’ UTR. (1) The 5’-terminal 90-nt region harbors 
stem-loops SLI (nt 11 to 42) and SLI (nt 51 to 84) as predicted 
by the Tinoco algorithm and as supported by structure probing 
(8, 9). 5’-terminal deletions of up to 13 nt kill DI RNA repli- 
cation (8), and other mutations made within the first 90 nt 
rapidly revert to the wild-type (wt) sequence (9) by a high- 
frequency recombination event with the helper virus genome, 
known as leader switching (28). Thus, it appears that the se- 
quence and/or the stem-loops are a cis-acting feature. A recent 
report described two stem-loops in the region of nt 1 to 51 and, 
of these, the second functions as a cis-replication element in 
the context of the MHV genome (24). (ii) SLIT (nt 97 to 116) 
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is predicted by both the Tinoco and Zuker algorithms, is sup- 
ported by RNase structure probing, and appears to have a 
homolog in coronavirus groups 1 to 3 (34). In essentially all 
coronaviruses examined, a translation initiation codon for a 
short intra-5’ UTR open reading frame (ORF) of unknown 
function is associated with SLIHI (34). Base covariations in the 
helical stem indicate that SLII is phylogenetically conserved 
among group 2 coronaviruses (34 and data not shown). (iil) 
SLIV (nt 186 to 215) is a cis-acting phylogenetically conserved 
structure in group 2 coronaviruses with the exception of SARS- 
CoV (35). The structure of SLIV in SARS-CoV appears sim- 
ilar to those predicted for group 1 coronaviruses (35). 

During initial studies to determine the structural require- 
ments for BCoV DI RNA replication, it was learned that 
translation of the continuous ORF within the DI RNA was 
necessary for replication (7). Beyond this, it has not been 
determined how the partial nsp1 coding sequence functions in 
DI RNA replication. Interestingly, an analogous region of the 
nsp1 coding sequence is found in all naturally occurring and in 
one synthetically created coronavirus DI RNAs described to 
date (reviewed in reference 3). In addition, in the context of 
the MHV genome, a deletion of the 5’-proximal region of the 
nsp1 coding sequence, but not of the 3’-proximal region, pre- 
vents genome replication as determined through reverse ge- 
netics analyses (6). The mechanistic contribution of this se- 
quence was not determined, but it was speculated to be 
perhaps a function of the protein product, an RNA structure, 
or both (6). 

In this study, we have examined the function of the 5’- 
proximal 288 nt of the nsp1 coding region in the DI RNA ORF 
that provides replication competence to the BCoV DI RNA. 
We observed that (i) the 5’-terminal 186 nt of the nsp1 coding 
region are necessary and sufficient for DI RNA replication, (11) 
within the 186-nt sequence are found two Mfold-predicted 
stem-loops, named SLV (nt 239 to 310) and SLVI (nt 311 to 
340), that are supported by RNase structure probing and by 
nucleotide covariations that maintain the helical stems among 
the closely related group 2 coronaviruses, and (iii) SLVI is a 
required higher-order structure for DI RNA replication on the 
basis of mutation analyses. The function of SLV has not been 
evaluated. We conclude that SLVI within the BCoV nsp1 cod- 
ing region acts as a higher-order cis-replication element for DI 
RNA and postulate that it functions similarly in the viral ge- 
nome. 


MATERIALS AND METHODS 


Cells, virus, and DI RNA. A DI RNA-free stock of the Mebus strain of BCoV 
(genome sequence; GenBank accession no. U00735) at 4.5 < 10° PFU/ml was 
used as a helper virus as described previously (7, 8). The human rectal tumor cell 
line HRT-18 (43) was used in all experiments. pDrep1 is a pGEM3Zf(-) (Pro- 
mega)-based plasmid containing the cDNA clone of a naturally occurring 2.2-kb 
DI RNA of BCoV modified to carry a 30-nt in-frame reporter (Fig. 1A) (8). 

Enzyme structure probing of in vitro-transcribed RNA. RNA secondary struc- 
ture modeling was performed using Mfold version 3.2 at http:/www.bioinfo.rpi 
.edu/applications/mfold/ (31, 49). Enzyme structure probing was carried out as 
described elsewhere (37). Briefly, for in vitro synthesis of RNA, 10 pg of Xbal- 
linearized mung bean nuclease blunt-ended pDrep1 DNA (Fig. 1A) in a 200-pl 
reaction volume was transcribed at 37°C for 1 h using 80 U of T7 RNA poly- 
merase (Promega). A yield of ~40 wg of a 595-nt-long vector-free transcript was 
obtained. RNA was treated with 10 U of RNase-free DNase (Promega) at 37°C 
for 30 min, extracted with phenol-chloroform, chromatographed through a Bio- 
spin 6 column (Bio-Rad), spectrophotometrically quantified, and stored in water 
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at —20°C. For RNase treatments, 40 wg of RNA was heat denatured at 65°C for 
3 min and renatured by slow cooling (0.5 h) to 35°C in a 400-pl reaction volume 
containing 30 mM Tris HCl (pH 7.5)-20 mM MgCl,-300 mM KCl, and aliquots 
containing 2 wg of sample RNA and 10 pg of yeast tRNA were incubated in 100 
wl of the same buffer and 1.0, 0.1, 0.05, 0.01, or 0.001 U of RNase CV1 (Phar- 
macia) or 5.0, 1.0, or 0.1 U of RNase T, (GIBCO), as noted below in the legend 
to Fig. 3. RNase digestion was carried out at 25°C for 15 min and terminated by 
the addition of 150 pl of 0.5 M sodium acetate, after which the RNA was 
extracted with phenol-chloroform and ethanol precipitated. Digested RNA prep- 
arations were used in primer extension reactions with 5’-end-labeled plus-strand- 
binding primer 1 (BCoV332-352 for mapping SLV) or primer 2 (BCoV379-400 
for mapping SLVI) (Table 1). Undigested RNA was used with the same primers 
to identify natural reverse transcription stops and in dideoxynucleotidyl sequenc- 
ing reactions to mark nucleotide positions. Products were analyzed on a DNA 
sequencing gel of 6% polyacrylamide. 

Construction of mutant DI RNAs. Plasmids from which mutant DI RNAs were 
transcribed in vitro were modifications of pDrep1 (8). Modifications of pDrep1 
were made by overlap PCR mutagenesis (19, 38, 45) using pDrep1 DNA tem- 
plate and the oligonucleotide primers described in Table 1. Restriction endonu- 
clease sites used for final construction of the mutant plasmids are shown in Fig. 
1A. Mutated regions were confirmed by sequencing. 

To make p77Drep, the 210-nt 5’ UTR in pDrep1 was shortened by overlap 
mutagenesis wherein the products from primers PGEM3Z2097(-), BCoV47- 
74(+), DI211-232(-), and oligo 2(+) were extended, and the mutated 1,158-nt 
NdeI fragment from the extended product (the upstream Ndel site is within the 
vector) was used to replace the analogous fragment in pDrepl. To make 
p210Nrep, pNrep2 (8) was digested with BglII, and the resulting 754-nt BglII 
fragment was used to replace the analogous fragment in pDrep1. 

3’-end deletions of the 288-nt partial nsp1 coding region within pDrep1 were 
made by overlap PCR mutagenesis wherein primers Nstart(-) and oligo 2(+) 
were used to make the 3’-proximal overlapping fragment and _ primers 
PGEM3Z2477(-) and the indicated mutant primers (Table 1) were used to make 
the 5’-proximal overlapping fragments of differing lengths. The mutated Hpal- 
Xbal fragments from the final extended PCR products were used to replace the 
analogous fragment in pDrep1l. The name of each deletion mutant corresponds 
to the primer used with the same name. 

pLeftVI was made by PCR _ overlap mutagenesis wherein primers 
PGEM3Z2477(-), Leftvi(+), Leftvi(-), and oligo 2(+) were used to make an 
overlap PCR product from which the mutated 1,158-nt NdeI fragment was used 
to replace the analogous fragment in pDrepl. pRightVI was made in the same 
way except for the use of primers Rightvi(+) and Rightvi(-), respectively. 

In vitro translation. For in vitro translation, ~2 wg of transcript was translated 
in a 50-pl reaction mix containing 35 pl of rabbit reticulocyte lysate (Promega) 
and 20 wCi of [?°S]methionine (>800 Ci/mmol; MP Biochemicals) as recom- 
mended by the manufacturer. Radiolabeled proteins were resolved by sodium 
dodecyl sulfate-polyacrylamide gel electrophoresis in gels of 12% polyacrylamide 
(Tris-HCl Ready gel; Bio-Rad) by the method of Laemmli (25), and dried gels 
were exposed to Kodak XAR-5 film for imaging. 

Northern assay for DI RNA replication. The Northern assay for detecting 
reporter-containing DI RNAs was performed as described previously (8, 33). 
Briefly, 5 wg of Mlul-linearized plasmid DNA was blunt ended with mung bean 
nuclease and transcribed with 40 U of T7 RNA polymerase (Promega) in a 
100-1 reaction mixture to produce uncapped DI RNA. The reaction mix was 
treated with 5 U of RNase-free DNase (Promega), and RNA was chromato- 
graphed through a Biospin 6 column (Bio-Rad) and quantitated spectrophoto- 
metrically. Cells (~6 X 10°) at ~80% confluence in a 35-mm dish were infected 
with BCoV at a multiplicity of 10 PFU per cell and transfected 1 h later with 1 
wg of RNA using Lipofectin (Bethesda Research Laboratories). At the indicated 
times postinfection, RNA (approximately 10 wg per plate) was extracted by the 
Nonidet P-40-proteinase K method (8) and stored as an ethanol precipitate. For 
passage of progeny virus, supernatant fluids were harvested at 48 h postinfection 
(hpi), and 500 pl was used to infect freshly confluent cells (~8 X 10°) in a 35-mm 
dish from which RNA was extracted at 48 hpi. For electrophoretic separation of 
RNA in a formaldehyde-agarose gel, 2.5 wg per lane was used. Approximately 1 
ng of transcript, identified as RNA in the Northern blot figures, was loaded per 
lane when used as a marker. RNA was transferred to Nytran membrane by 
vacuum blotting, and the UV-irradiated blots were probed with oligonucleotide 
TGEV(+) that had been 5’-end labeled with **P to specific activities of 1 x 10° 
to 4 X 10’ cpm/pmol. Probed blots were exposed to Kodak XAR-5 film for 1 to 
7 days at —80°C for imaging. 

Reverse transcription-PCR and sequence analysis of mutant DI RNA repli- 
cons. With RNA extracted from cells at 24 h postinfection with VP1 virus 
collected from mutant transfections, reverse transcription was done using DI 
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FIG. 1. Location of cis-replication elements within the partial nsp1 coding region in BCoV DI RNA. A. Structure of the naturally occurring 
BCoV DI RNA relative to the full-length BCoV genome. The DI RNA is composed of the fused ends of the viral genome as described in the text. 
The 65-nt leader is illustrated by a filled rectangle. The partial nsp1 sequence in parentheses represents the 5’-proximal 288 nt of the 738-nt nsp1 
coding region. The cloned, modified BCoV DI RNA is under control of the T7 RNA polymerase promoter, carries a 30-nt in-frame reporter used 
for Northern probing, and is named pDrep1. Restriction endonuclease sites used for further engineering as described in this report are depicted. 
The cloned, modified BCoV N mRNA (mRNA 7) is under control of the same promoter as pDrep1, carries the same reporter, and is named 
pNrep2. The difference in the RNA transcripts from the two plasmids both linearized at the Mlul site is a continuous sequence of 421 nt as 
depicted. At the right are shown the results of Northern analyses depicting the accumulation of pDrep1 RNA at 48 and 96 h posttransfection and 
at 24 hpi following first virus passage (VP1), and the absence of accumulation of the pNrep2 RNA. RNA, ~1 ng of transcript used for transfection. 
B. Structures of modified pDrep1 that carry the 5’ UTR of mRNA 7, named p77Drep, and the 5’ UTR of the genome but without the 288-nt region 
of nsp1, named p210Nrep. At the right are Northern analyses depicting the accumulation of wt DI RNA and the absence of accumulation for RNAs 
from p77Drep and p210Nrep. RNA, ~1 ng of transcript used for transfection. C. Deletion analysis of the 288-nt partial nsp1 coding region in BCoV 
DI RNA. 3’-terminal deletions of the partial nsp1 coding region were made as indicated, and RNAs from the mutants were tested for 
accumulation. The results of Northern analyses at the right illustrate that DI RNAs carrying the first 186 nt of nsp1 coding region accumulated, 
whereas those with less than 186 nt did not. RNA, ~1 ng of transcript used for transfection. 


SOeIGI] egolue Jo Alisuaniup AG GLOZ ‘SG eunr UO /Bio‘Wwse'IAl//:duy Wood) papeo|uUMOG 


VoL. 81, 2007 cis-REPLICATION STEM-LOOP IN BCoV nsp1 CODING REGION — 7719 
TABLE 1. Oligonucleotides used in this study 

Oligonucleotide Polarity” Sequence (5’—3") ee ” 
1aA217-498 (+) = GAGTAAAAGACAT/CGACATTGTG 207-216/499-511 
1aA238-498 (+) = GAGTAAAAGACAT/GAGACCGTATTTG 225-—237/499-511 
1aA283-498 (+) = GAGTAAAAGACAT/TCTGCGTCCTC 274-282/499-511 
1aA319-498 (+) = GAGTAAAAGACAT/CACCTCTGAAC 308-318/499-511 
1aA358-498 (+) = GAGTAAAAGACAT/TGTTTCCAGC 348-357/499-511 
1aA397-498 (+) = GAGTAAAAGACAT/ACAATCCACCATC 384—396/499-511 
1aA409-498 (+) = GAGTAAAAGACAT/AAGAAGTCGGC 398-408/499-511 
1aA424-498 (+) = GAGTAAAAGACAT/ACAACACTCTTG 412-423/499-511 
1aA442-498 (+) = GAGTAAAAGACAT/TATTAGGCTAG 431-441/499-511 
1aA454-498 (+) = GAGTAAAAGACAT/AACAATTTCACG 442—453/499-511 
1aA478-498 (+) = GAGTAAAAGACAT/CAAATCATATGGAC 464-477/499-511 
BCoV47-74 (+) oe TAAAGTTTAGATTATAAAAAGATCTAAC 47-74 
BCoV332-352 (+) = CCAGCTTTTGCGCAGTGGTGG 332-352 
BCoV379-400 (+) sa GGCGACAATCCACCATCACATG 379-400 
DI211-232 (=) a CGTATTTGTTGATCTTCGACAT/CCTTAAAGTTTAG 65-74/211-232 
Leader7-32 (—) a GAGCGATTTGCGTGCGTGCATCCCGC 7-32 
LeftVI (—) a GTAGTTCAGAAGTCGATATAGTATG 305-329 
LeftVI (+) = CATACTATATCGACTTCTGAACTAG 305-329 
Nstart (—) se ATGTCTTTTACTCCTGGTAAGC 499-520 
Oligo2 (+) = GTCCCGATCGAGAATGTCAGCCGGGG 968-993 
PGEM3Z2097 (—) or AGGGCGACACGGAAATGTTG (2097-2116) 
PGEM3Z2477 (—) ci GGCATCAGAGCAGATTGTACTG (2477-2498)4 
RightVI (—) a GATATAGTATGCTCGACTACTGCGC 319-343 
RightVI (+) = GCGCAGTAGTCGAGCATACTATATC 319-343 
TGEV 8 (+) = CATGGCACCATCCTTGGCAACCCAGA 1098-1123 


“The positive and negative symbols in oligonucleotide names indicate the polarity of the nucleic acid to which the oligonucleotide anneals. 


> Polarity of the oligonucleotide relative to the positive-strand DI RNA. 


“ For probe binding to negative-strand sequence, the numbers given correspond to complementary positive-strand sequence. 
¢ Binds in the pGEM3Zf(-) vector upstream of the DI RNA transcription start site. 


RNA-specific primer TGEV(+), PCR was done using primers TGEV(+) and 
leader(-), and the PCR product was sequenced directly. 


RESULTS 


The 288-nt partial nsp1 coding region provides a required 
function for BCoV DI RNA replication. Previous studies had 
demonstrated that a T7 RNA polymerase-generated ssgmRNA 
7 (mRNA for the N protein) of BCoV containing a short 
reporter sequence does not replicate when transfected into 
BCoV-infected cells, whereas the DI RNA containing the 
same reporter sequence does (8). The difference between the 
two molecules is a continuous stretch of 421 nt that is made up 
of some of the genomic 5’ UTR sequence and all of the 288-nt 
partial nsp1 region (Fig. 1A). To test whether the genomic 5’ 
UTR and the partial nsp1 coding region independently fulfill 
requirements for replication of the DI RNA, constructs were 
made to test each domain separately. In the first, the 210-nt DI 
RNA 5’ UTR was precisely replaced with the 77 5’ UTR of 
semRNA 7, resulting in p77Drep, which lacks 136 nt of the 
genomic 5’ UTR but retains the 288-nt partial nsp1 coding 
region. Transcripts of p77Drep were then tested for replication 
in helper virus-infected cells (Fig. 1B). Accumulation of DI 
RNA by 24 and 48 h posttransfection, and by 24 hpi following 
infection with first virus passage (VP1), were used as evidence 
of DI RNA replication (8, 34, 35, 45). As shown in Fig. 1B, this 
molecule failed to replicate, indicating signals necessary for 
replication are missing. This result has been confirmed and 
extended by experiments showing requirements for both SLII 
and SLIV (mapping between nt 97 and 210) in DI RNA rep- 
lication (34, 35). In the second construct, the 77-nt 5’ UTR of 


the reporter-containing mRNA 7 construct (pNrep2) (8) was 
precisely replaced with the 210-nt genomic 5’ UTR, forming 
p210Nrep. From this plasmid, a DI RNA-like molecule was 
made that had the 210-nt genomic 5’ UTR and all the N- 
coding domain and reporter sequence but was missing the 
entire 5’-proximal 288 nt of the nsp1 coding domain. Figure 1B 
illustrates that a transcript of p210Nrep also failed to replicate 
in helper virus-infected cells, thus demonstrating a require- 
ment for all or part of the 288-nt partial nsp1 coding region in 
the DI RNA for replication. 

cis-acting signals for DI RNA replication map within the 
5'-terminal 186 nt of the partial nsp1 coding region in the DI 
RNA, a region containing SLV and SLVI. To characterize the 
cis-acting features within the 288-nt partial nsp1 coding region, 
conserved sequences and higher-order structures among eight 
group 2 coronaviruses previously studied (BCoV-Mebus, hu- 
man coronavirus [HCoV]-OC43, HCoV-4408, porcine 
hemagglutinating encephalomyocarditis [HEV]-TN11, equine 
coronavirus [ECoV]|-NC99, classed as BCoV-like, and mouse 
hepatitis coronavirus [MHV-A59, MHV-2, and MHV-JHM], 
classed as MHV-like [46]), were identified and treated as po- 
tential cis-acting elements. For this, sequences were aligned 
(46) and the Mfold program of Zuker applied to each. Four 
predicted hairpin stem-loops showing nucleotide variations in 
the helical stems among the eight viruses were identified and 
named SLV through SLVIII (Fig. 2). This nomenclature con- 
tinues that used for BCoV 5’-proximal stem-loops (7, 8, 34, 
35). In BCoV, SLV maps at nt 239 to 310, SLVI at nt 311 to 
340, SLVII at nt 356 to 373, and SLVIII at nt 378 to 446, and 
the calculated free energies are —27.1, —10.9, —2.9, and —26.4 
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FIG. 2. Stem-loops in the 5’-proximal 288 nt of the 738-nt nsp1 coding region as predicted by Mfold. Named nucleotides are those in BCoV. 
Unless otherwise indicated, they are also the nucleotides in the other named group 2 coronaviruses and are highlighted in gray. Arrows identify 
bases in MHV-A59, MHV-2, and MHV-JHM unless otherwise identified by superscipts. The compared group 2 coronaviruses and their superscript 
designations, if any, are BCoV-Mebus, HCoV-OC43 (0), HCoV-4408, HEV-TN11 (h), ECoV-NC99 (e), MHV-A59 (a), MHV-2 (2), and 
MHV-JHM (j). The free energies noted refer to the stem-loops in the BCoV RNA. 


kCal/mol, respectively (Fig. 2). Identical bases aligning among 
the eight coronaviruses are shown in Fig. 2, and those that 
differ are identified. Although the SARS-CoV has been clas- 
sified as a group 2 coronavirus (41), structural homologues of 
stem-loops V to VIII in SARS-CoV were not apparent by 
these analyses (data not shown). 

To determine whether the entire nsp1 portion present in the 
DI RNA molecule is required for DI RNA replication, a set of 
3'-terminal in-frame deletions of this region in pDrep1 was 
made by overlap PCR mutagenesis, and T7 RNA polymerase- 
generated transcripts of each mutant were tested for replica- 
tion in helper virus-infected cells. Figure 1C summarizes the 
data showing that deletion mutants 1a4478-498 through 
1aA397-498 replicated, as evidenced by accumulation of DI 
RNA after VP1 and VP2, whereas 1aA358-498 through 
1aA217-498 did not. A low level of DI RNA accumulation 
prior to virus passage posttransfection (for example, the unde- 
tectable amounts for mutant p454-498 at 24 h and 48 h post- 
transfection in Fig. 1C) may reflect structural influences on 
replication, but we noted that amounts of accumulation at 
these times did vary widely, even for wt DI RNA (e.g., compare 
Fig. 1 and 4, below). Thus, for DI RNA replication, the 3’- 
terminal 102 nt of the 288-nt partial nsp1 coding region, which 
contains most of SLVIII, are not needed. It was therefore 
concluded that the SLV- to SLVII-containing 5’-proximal 
186-nt region contains one or more cis-acting elements re- 
quired for DI RNA replication. Of these, SLV and SLVI were 


chosen for further analysis since they possessed relatively low 
free energies and demonstrated sequence conservation in the 
stems. 

RNase structure maps are consistent with predicted stem- 
loops V and VI. To determine whether stem-loops V and VI as 
predicted by Mfold are supported by enzyme structure probing, 
synthetic transcripts of the first 288 nt of the nsp1 coding 
region were probed by methods used previously to analyze 
stem-loops I through IV and the 3’-proximal pseudoknot (9, 
34, 35, 45). For this, pDrep1 was linearized at the Xbal site and 
T7 RNA polymerase-generated transcripts were digested with 
various concentrations of RNase specific for single-stranded 
regions (RNase T, [nonspecific with a preference for A]) and 
helical regions (cobra venom RNase V1). The positions of 
cleavage sites in the RNA (summarized in Fig. 3A) were de- 
termined by analyzing primer extension products in parallel 
with a dideoxy sequencing ladder generated from untreated 
RNA (Fig. 3B and C). Primer extension products of uncut 
RNA were also examined to identify natural reverse transcrip- 
tion stops. The results showed that for SLV via extension of 
primer 1 (Fig. 3B), strong single-strand hits were present at nt 
227 to 230, a predicted single-stranded region between SLIV 
and SLV, at nt 268 and 269, sites within and adjacent to a 
predicted 2-nt bulge, and at nt 273 to 276, sites within the 
predicted 4-nt terminal loop. In addition, weak single-strand 
hits were observed at nt 267, a site within a 2-nt bulge, at nt 
272, a site adjacent to the terminal loop, at nt 282, a site 
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FIG. 3. Enzyme structure probing of SLV and SLVI in BCoV. A. Schematic summary of structure probing data. B. Electrophoretic analysis 
of digestion products in the SLV region from extensions of primer 1. Lanes 1 through 4, sequencing ladder generated from primer 1; lanes 5 
through 8, RNase CV, digestion with 1.0, 0.1, 0.05, and 0.01 U, respectively; lanes 9 through 11, RNase T, digestion with 5.0, 1.0, and 0.1 U, 
respectively; lane 12, undigested RNA. C. Electrophoretic analysis of digestion products in the SLVI region from extensions of primer 2. Lanes 
1 through 4, sequencing ladder generated from primer 2; lanes 5 through 7, RNase CV, digestion with 0.1, 0.01, and 0.01 U, respectively; lanes 


8 and 9, RNase T, digestion with 1.0 and 0.1 U, respectively. 


adjacent to an internal loop, and at nt 298 and 299, sites within 
a predicted internal loop. Strong double-strand hits were 
present at nt 222 to 226, a previously unmapped region that 
appears to extend the length of SLIV described earlier (35) (to 
be analyzed further elsewhere). Strong double-strand hits were 
also present at nt 242 and 243, a predicted double-stranded 
region, at nt 263 and 264, adjacent to and within an internal 
loop, and at nt 277, adjacent to the terminal loop. In addition, 
weak double-strand hits were observed within stems at nt 255, 
262, 265, 269, 270, 278, 288, and 310 and within loop regions at 


nt 258 and 268. Double-strand hits in predicted loop regions 
may be explained by the fact that CV1 on occasion digests 
within single-stranded regions, adopting an approximately he- 
lical conformation (27). 

The mapping results for SLVI as observed via extension of 
primer 2 (Fig. 3C) show strong single-strand hits at nt 321 to 
325, which represent 5 of the 10 nt in the predicted terminal 
loop. Strong double-strand hits were present at nt 317 to 319, 
a region predicted to be in the upper stem. 

Thus, RNase probing with T, and CV1 RNases overall 
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FIG. 4. Effects of SLVI mutations on accumulation of BCoV DI 
RNA. A. Mutations used to test the cis-requirement of SLVI for DI 
RNA replication. B. Northern analysis to measure the accumulation of 
wt and SLVI mutant DI RNAs. The *’P-labeled oligonucleotide probe 
specific for the 30-nt TGEV reporter sequence in the DI RNAs was 
used. RNA, ~1 ng of transcript used for transfection. C. Translation of 
wt and mutant DI RNAs. T7 RNA polymerase-generated transcripts 
of wt and mutant DI RNA were translated in rabbit reticulocyte lysate 
in the presence of *°S-labeled methionine, and the dried gel was 
autoradiographed. Lane 1, high-molecular-weight Rainbow marker 
proteins (Amersham); lane 6, translation product from a reaction 
mixture to which no RNA was added. 


yielded results consistent with the stem-loop V and VI struc- 
tures predicted by Mfold. 

Mutation analyses show that SLVI plays a functional role in 
DI RNA replication. Between SLV and SLVI, SLVI was cho- 
sen for testing a cis-replication function since it is relatively 
small and lends itself to translationally silent stem-disrupting 
nucleotide changes. It also demonstrated a high level of nucle- 
otide sequence conservation in its upper stem compared with 
other group 2 coronaviruses (46) (Fig. 2). 

To test the replication function of SLVI, base substitutions 
were made in the third (wobble) nucleotide positions of four 
base-paired codons in the upper stem that were predicted to 
disrupt helical structure but preserve wild-type amino acid 
sequence (Fig. 4A). In construct pLeftVI, substitutions 
315G—A and 318G-C were made, and following transfection 
of transcripts into virus-infected cells no DI RNA accumula- 
tion was observed at 24 and 48 h posttransfection, nor was 
replication observed at 24 h following the first passage of virus 
(Fig. 4B, LVI). Likewise, when mutations 333C->G and 
336C—T were made to form pRightVI, no replication was 
observed (Fig. 4B, RVI). However, when compensatory resto- 
ration of stem-loop VI was made by combining the mutations 
in pLeftVI and pRightVI, replicating ability was restored as 
evidenced by the accumulation of DI RNA at 24 and 48 h 
posttransfection and by replication following the first virus 
passage (Fig. 4B, L/RVI). Sequence analysis of cDNA clones 
made from VP1 RNA of the replicating L/RVI double mutant 
demonstrated that all four mutations were retained (data not 
shown). 


J. VIROL. 


Since the mutations made in SLVI resulted in codons used 
in the BCoV genome (57% of the time for GAA [Glu], 8% of 
the time for GUC [Val], 2.6% of the time [19 times total] for 
UCG [Ser], and 48% of the time for ACU [Thr]), it was not 
anticipated that blocks in DI RNA replication would result 
from failed translation of the mutated DI RNA ORFs. To 
establish that translation was not blocked by the mutations, 
transcripts of the mutant DI RNAs were translated in rabbit 
reticulocyte lysate, and the products were examined. The re- 
sults shown in Fig. 4C demonstrate that wt and all three mutant 
forms of DI RNA are translated to generate the fused DI RNA 
ORF product of approximately 62 kDa. The apparent larger 
size for this protein is due to its phosphorylation (data not 
shown). 

From these combined results, it was concluded that stem- 
loop VI, as a higher-order structure, is a required cis-acting 
element for BCoV DI RNA replication. 

Whether SLV functions similarly remains to be determined. 


DISCUSSION 


In the current study, we report the existence of a higher- 
order RNA structure identified as SLVI within the nsp1 coding 
region of the coronaviral genome that appears to act directly in 
cis for DI RNA replication. SLVI maps within a part of the 
nsp1 coding region found in all naturally occurring coronavirus 
DI RNAs (3, 4, 8) and in one synthetic MHV-A59 DI RNA 
(30) described to date, suggesting that a structure analogous to 
SLVI may be required for the replication of these DI RNAs as 
well. In at least one other instance, an apparently directly 
acting higher-order cis-replication structure has been found to 
map within the coding region of a coronavirus DI RNA. In that 
case, the structure, a 55-nt stem-loop derived from the nsp 3 
ORF, exists in a subset of multipartite DI RNAs generated 
from MHV-JHM (36). Other cis-acting RNA structures have 
been mapped within the coding region of the coronavirus ge- 
nome but appear to function indirectly in genome replication 
(29). These include the ribosome frameshifting pseudoknot 
located just downstream of the ORF 1a/1b junction (5, 13), the 
(usually) intergenic core sequence (also called the transcrip- 
tion regulatory sequence) associated with RdRp template 
switching during formation of ssmRNA templates (50), and a 
genome packaging signal that maps within ORF 2b (11). A 
cis-replication element has been described within a coding 
region of an arterivirus (2, 44), a virus group that along with 
coronaviruses, toroviruses, and roniviruses is a member of the 
Nidovirus group. This cis-acting structure is in the 3’-proximal 
region of the arterivirus genome, however, and to date appears 
not to have a homolog in the coronavirus genome. 

Whether SLVI is the only directly acting cis RNA replication 
element within the 5’-terminal region of ORF 1a remains to be 
determined. SLV is a complex stem-loop that shows consider- 
able structural conservation among the group 2 coronaviruses 
but has yet to be examined for a cis-replication function. Cu- 
riously, like SLVI, the terminal loop of SLV appears to be the 
most variable part of its structure. The proximity of SLV and 
SLVI suggests they may be components of a single larger 
cis-acting structure. 

It also remains to be determined what role the protein mot- 
ety of the partial nsp1 product plays as a cis-acting feature, if 
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any. It was initially thought that translation of the partial ORF 
la sequence in the BCoV DI RNA is also required for repli- 
cation (7). Replacing the AUG start codon for the partial nsp1 
ORF in DI RNA with a UAG stop codon resulted in no DI 
RNA replication, even when the N portion of the DI RNA 
ORF remained open (7). The notion that translation of the 
partial nsp1 ORF is required for DI RNA replication has been 
supported by two additional experiments in which the amino 
acid composition within short regions of the partial nsp1 prod- 
uct was changed and tested. In the first mutant, a nucleotide 
deletion at position 223 (causing a —1 frameshift) and a nu- 
cleotide insertion at position 304 (causing a +1 frameshift) 
caused amino acids 5 through 31 to change from 5NKYGLE 
LHWAPEFPWMFEDAEEKLDNP31 in the wt DI RNA to 
STNTVSNYTGLONFHGCLRTORRSWITL31 in the mu- 
tant. In the second mutant, a nucleotide deletion at position 
270 (causing a —1 frameshift) and a nucleotide insertion at 
position 304 (causing a +1 frameshift) caused amino acids 20 
through 31 to change from 20MFEDAEEKLDNP31 in the wt 
DI RNA to 20CLRTQRRSWITL31 in the mutant. Both of 
these mutations map within the region of SLV. Neither mutant 
replicated (data not shown). Thus, it could be that a feature of 
the translated partial nsp1 product fulfills a cis-requirement for 
replication, but it cannot be ruled out at this time that the 
nucleotide insertions and deletions altered a required cis-act- 
ing RNA structure. 

To date, nsp1 has been the most characterized in MHV and 
SARS-CoV. In MHV, nsp1 is a 28-kDa protein, also called 
p28, the first proteolytically cleaved product of the polyprotein 
synthesized from ORF 1a (12). When p28 is expressed in the 
absence of the other viral proteins, it has profound effects on 
cell metabolism, including the induction of cell cycle arrest at 
G,/G, (10). Interestingly, by using reverse genetics it was 
shown that the C-terminal region of p28 is not absolutely 
required for virus replication but that deletion of the N-termi- 
nal region is lethal for the virus (6). Expression of the SARS- 
CoV nsp1 protein alone in cells promoted host mRNA degra- 
dation (23). 

The identification of cis-replication elements in the large 
coronavirus genome remains a technical challenge but is 
now approachable with the use of reverse genetics systems 
(reviewed in reference 29). An important caveat when using 
DI RNAs for this identification is that such elements might 
be idiosyncratic for the DI RNA genome. One example of 
this is the coronavirus common octamer (GGAAGAGC) in 
the 3’ UTR which functions as a cis-replication element in 
DI RNA but not in the full-length genome (15). On the basis 
of the work presented here, we suggest that the presence of 
a cis-acting element for genome replication mapping within 
the 5’-proximal region of nsp1 may be one reason the 5’- 
proximal region of nsp1 cannot be deleted from the intact 
MHV genome (6). The presence of a cis-acting element 
within the partial nsp1 coding region along with SLIII and 
-[V in the BCoV 5’ UTR could also explain why it is that the 
semRNAs produced by this virus, although they are found in 
double-stranded RNA-synthesizing transcriptive intermedi- 
ates (1, 37), are not replicating molecules as once postulated 
(17, 39, 40). 
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